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1 Query optimization in distributed networks of autonomous database systems 
Fragkiskos Pentaris, Yannis Ioannidis 

June 2006 ACM Transactions on Database Systems (TODS), volume 31 issue 2 
Publisher: ACM Press 

Full text available: || pdf(1.55 MB) Additional Information: full citation , abstract , references , index terms 

Large-scale distributed environments, where each node is completely autonomous and 
offers services to its peers through external communication, pose significant challenges to 
query processing and optimization. Autonomy is the main source of the problem, as it 
results in lack of knowledge about any particular node with respect to the information it 
can produce and its characteristics, for example, cost of production or quality of produced 
results. In this article, inspired by e-commerce technolog ... 

Keywords: Query optimization 

2 Query processing for relational data: Query evaluation using overlapping views: j 
4k completeness and efficiency 

^ Gang Gou, Maxim Kormilitsin, Rada Chirkova 

June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 

Management of data SIGMOD '06 
Publisher: ACM Press 

Full text available: g.pdf(383.41 KB), Additional Information: full citation , abstract , references , index terms 

We study the problem of finding efficient equivalent view-based rewritings of relational 
queries, focusing on query optimization using materialized views under the assumption 
that base relations cannot contain duplicate tuples. A lot of work in the literature 
addresses the problems of answering queries using views and query optimization. 
However, most of it proposes solutions for special cases, such as for conjunctive queries 
(CQs) or for aggregate queries only. In addition, most of it addresses ... 



Keywords: materialized views, query optimization, rewriting queries using views 



A case for dynamic view management 
Yannis Kotidis, Nick Roussopoulos 
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December 2001 ACM Transactions on Database Systems (TODS), Volume 26 issue 4 
Publisher: ACM Press 

Full text available: I B pdf(892.57 KB) Additional Information: full citation , abstract , references , citings , index 

terms , review 

Materialized aggregate views represent a set of redundant entities in a data warehouse 
that are frequently used to accelerate On-Line Analytical Processing (OLAP). Due to the 
complex structure of the data warehouse and the different profiles of the users who 
submit queries, there is need for tools that will automate and ease the view selection and 
management processes. In this article we present DynaMat, a system that manages 
dynamic collections of materialized aggregate views in a data warehous ... 

Keywords: Data cube, OLAP, data warehousing, materialized views 

Optimization of dynamic query evaluation plans 
Richard L. Cole, Goetz Graefe 

May 1994 ACM SIGMOD Record , Proceedings of the 1994 ACM SIGMOD international 

conference on Management of data SIGMOD '94, Volume 23 issue 2 
Publisher: ACM Press 

Full text available: fi §pdf(1.45MB) Additional Information: full citation, abstract , references , citings , index 

terms 

Traditional query optimizers assume accurate knowledge of run-time parameters such as 
selectivities and resource availability during plan optimization, i.e., at compile time. In 
reality, however, this assumption is often not justified. Therefore, the "static" plans 
produced by traditional optimizers may not be optimal for many of their actual run-time 
invocations. Instead, we propose a novel optimization model that assigns the bulk of the 
optimization effort to compile-time and ... 

5 Query Optimjzatjon iJdnt optimization of cost and covera ge of query plans in data 
* integration 

Zaiqing Nie, Subbarao Kambhampati 

October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 

Publisher: ACM Press 

Full text available: fllpdff1.74 MB) Additional Information: full citation, abstract, references , citings , index 

terms 

Existing approaches for optimizing queries in data integration use decoupled strategies- 
attempting to optimize coverage and cost in two separate phases. Since sources tend to 
have a variety of access limitations, such phased optimization of cost and coverage can 
unfortunately lead to expensive planning as well as highly inefficient plans. In this paper 
. we present techniques for joint optimization of cost and coverage of the query plans. Our 
algorithms search in the space of parallel query plans ... 

Optimiz ation of query streams usin g semantic prefetching j 
Ivan T. Bowman, Kenneth Salem " 
December 2005 ACM Transactions on Database Systems (TODS), Volume 30 issue 4 
Publisher: ACM Press 

Full text available: g^dgUO MB) Additional Information: full citation, abstract , references , ind ex terms 

Streams of relational queries submitted by client applications to database servers contain 
patterns that can be used to predict future requests. We present the Scalpel system, 
which detects these patterns and optimizes request streams using context-based 
predictions, of future requests. Scalpel uses its predictions to provide a form of semantic 
prefetching, which involves combining a predicted series of requests into a single request 
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that can be issued immediately. Scalpel's semantic prefetching ... 
Keywords: Prefetching, query streams 



An adaptive query execution system for data integration 

Zachary G. Ives, Daniela Florescu, Marc Friedman, Alon Levy, Daniel S. Weld 

June 1999 ACM SIGMOD Record , Proceedings of the 1999 ACM SIGMOD international 

conference on Management of data SIGMOD '99, volume 28 issue 2 
Publisher: ACM Press 

Full text available: fi Qpdf(1.59 MB ) Additional Information: full citation , abstract , references , citings , index 

terms 

Query processing in data integration occurs over network-bound, autonomous data 
sources. This requires extensions to traditional optimization and execution techniques for 
three reasons: there is an absence of quality statistics about the data, data transfer rates 
are unpredictable and bursty, and slow or unavailable data sources can often be replaced 
by overlapping or mirrored sources. This paper presents the Tukwila data integration 
system, designed to support adaptivity at its core using ... 
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8 Special issue in parallelism in database systems: Considering data skew factor in 
multi-way join query optimization for parallel execution 
Kien A. Hua, Yo Lung Lo, Honesty C. Young 

July 1993 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 2 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: gpdf(143MB^ Additional Information: full citation , abstract , references , citings 

A consensus on parallel architecture for very large database management has emerged. 
This architecture is based on a shared-nothing hardware organization. The computation 
model is very sensitive to skew in tuple distribution, however. Recently, several parallel 
join algorithms with dynamic load balancing capabilities have been proposed to address 
this issue, but none of them consider multi-way join problems. In this article we propose 
a dynamic load balancing technique for multi-way joins, and i ... 

Keywords: load balancing, multi-way join, parallel-database computer, query 
optimization 




Industrial session: query processing and optimization: Analyzing plan diagrams of 
database guerv optimizers 
Naveen Reddy, Jayant R. Haritsa 

August 2005 Proceedings of the 31st international conference on Very large data 

bases VLDB '05 
Publisher: VLDB Endowment 

Full text available: pdf(362.80 KB) Additional Information: full citation , abstract , references , index terms 

A "plan diagram" is a pictorial enumeration of the execution plan choices of a database 
query optimizer over the relational selectivity space. In this paper, we present and 
analyze representative plan diagrams on a. suite of popular commercial query optimizers 
for queries based on the TPC-H benchmark. These diagrams, which often appear similar 
to cubist paintings, provide a variety of interesting insights, including that current 
optimizers make extremely fine-grained plan choices, which may often ... 

Join queries with external text sources: execution and optimization techniques 
Surajit Chaudhuri, Umeshwar Dayal, Tak W. Yan 

May 1995 ACM SIGMOD Record , Proceedings of the 1995 ACM SIGMOD international 
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conference on Management of data SIGMOD '95, volume 24 issue 2 
Publisher: ACM Press 

Full text available: i Bpdff1.49MB) Additional Information: full citation , abstract , references , citings, jndex 
m terms 

Text is a pervasive information type, and many applications require querying over text 
sources in addition to structured data. This paper studies the problem of query processing 
in a system that loosely integrates an extensible database system and a text retrieval 
system. We focus on a class of conjunctive queries that include joins between text and 
structured data, in addition to selections over these two types of data. We adapt 
techniques from distributed query processing and introduce a novel ... 

11 Multiple-query optimization 
Timos K. Sellis 

March 1988 ACM Transactions on Database Systems (TODS), Volume 13 issue l 
Publisher: ACM Press 

Full text available: Hlpdf(2.19 MB) Additional Information: full citation, abstract , references , citings, index 

terms , review 

Some recently proposed extensions to relational database systems, as well as to 
deductive database systems, require support for multiple-query processing. For example, 
in a database system enhanced with inference capabilities, a simple query involving a rule 
with multiple definitions may expand to more than one actual query that has to be run 
over the database. It is an interesting problem then to come up with algorithms that 
process these queries together instead of one query at a time. Th ... 

12 Research sessions: query optimization: Robust query processing through 

progressive optimization 

^ Volker Markl, Vijayshankar Raman, David Simmen, Guy Lohman, Hamid Pirahesh, Miso 
Cilimdzic 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: | § pdf(331.15 KB! Additional Information: full citation , abstract , references 

Virtually every commercial query optimizer chooses the best plan for a query using a cost 
model that relies heavily on accurate cardinality estimation. Cardinality estimation errors 
can occur due to the use of inaccurate statistics, invalid assumptions about attribute 
independence, parameter markers, and so on. Cardinality estimation errors may cause 
the optimizer to choose a sub-optimal plan. We present an approach to query processing 
that is extremely robust because it is able to detect and re ... 

1 3 Cache investment: integrating query optimization and dis tributed da ta placement 
" Donald Kossmann, Michael J. Franklin, Gerhard Drasch, Wig Ag 

December 2000 ACM Transactions on Database Systems (TODS), Volume 25 issue 4 
Publisher: ACM Press 

Full text available: #1 pdf(210.67 KB) Additiona l Information: full citation , abstract , references , citings , index 
lilj-fc^-J s terms 

Emerging distributed query-processing systems support flexible execution strategies in 
which each query can be run using a combination of data shipping and query shipping. As 
in any distributed environment, these systems can obtain tremendous performance and 
availability benefits by employing dynamic data caching. When flexible execution and 
dynamic caching are combined, however, a circular dependency arises: Caching occurs as 
a by-product of query operator placement, but query operator pi ... 

Keywords: cache investment, caching, client-server database systems, data shipping, 
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14 Research sessions: query processing I: Exploiting statistics on query expressions for 
^ optimization 

■ Nicolas Bruno, Surajit Chaudhuri 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 

Full text available: i ll pdf(1.33 MB) Additional Information: full citation , abstract , references , citings , index 

Statistics play an important role in influencing the plans produced by a query optimizer. 
Traditionally, optimizers use statistics built over base tables and assume independence 
between attributes while propagating statistical information through the query plan. This 
approach can introduce large estimation errors, which may result in the optimizer 
choosing inefficient execution plans. In this paper, we show how to extend a generic 
optimizer so that it also exploits statistics built on expression ... 

15 Research papers: optimization: Towards a robust query optimizer: a principled and 
practical approach 
Brian Babcock, Surajit Chaudhuri 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: "g|pdf(551.42 KB) Additional Information: full citation, abstract , references 

Research on query optimization has focused almost exclusively on reducing query 
execution time, while important qualities such as consistency and predictability have 
largely been ignored, even though most database users consider these qualities to be at 
least as important as raw performance. In this paper, we explore how the query 
optimization process can be made more robust, focusing on the important subproblem of 
cardinality estimation. The robust cardinality estimation technique that we propo ... 

16 Query processing: Estimating compilation time of a query optimizer 
Ihab F. Ilyas, Jun Rao, Guy Lohman, Dengfeng Gao, Eileen Lin 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: IB pdf(292.76 KB) Additional Information: full citation, abstract , references , citings, index 

ten Ens. 

A query optimizer compares alternative plans in its search space to find the best plan for 
a given query. Depending on the search space and the enumeration algorithm, optimizers 
vary in their compilation time and the quality of the execution plan they can generate. 
This paper describes a compilation time estimator that provides a quantified estimate of 
the optimizer compilation time for a given query. Such an estimator is useful for 
automatically choosing the right level of optimization in comme ... 

17 Adaptable query optimization and evaluation in temporal middleware 
^ Giedrius Slivinskas, Christian S. Jensen, Richard Thomas Snodgrass 

May 2001 ACM SIGMOD Record , Proceedings of the 2001 ACM SIGMOD international 

conference on Management of data SIGMOD '01, volume 30 issue 2 
Publisher: ACM Press 

Full text available: ■§) pdf ( 232.65 KB ) Additional Information: full citation , abstract , 

terms 
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Time-referenced data are pervasive in most real-world databases. Recent advances in 
temporal query languages show that such database applications may benefit substantially 
from built-in temporal support in the DBMS. To achieve this, temporal query optimization 
and evaluation mechanisms must be provided, either within the DBMS proper or as a 
source level translation from temporal queries to conventional SQL. This paper proposes a 
new approach: using a middleware component on top of a conventio ... 



18 Research sessions: non-standard query processing: Optimization of query streams 
usin g semantic prefetching 
Ivan T. Bowman, Kenneth Salem 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: g pdf(224.40 KB) Additional Information: full citation , abstract , references 

Streams of relational queries submitted by client applications to database servers contain 
patterns that can be used to predict future requests. We present the Scalpel system, 
which detects these patterns and optimizes request streams using context-based 
predictions of future requests. Scalpel uses its predictions to provide a form of semantic 
prefetching, which involves combining a predicted series of requests into a single request 
that can be issued immediately. Scalpel's semantic prefetching ... 

19 Query processing and optimization in Oracle Rdb 
Gennady Antoshenkov, Mohamed Ziauddin 

December 1996 The VLDB Journal — The International Journal on Very Large Data 

- Bases, Volume 5 Issue 4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: g pdf(92.62 KB) Additional Information: full citation , abstract , citings , index terms 

This paper contains an overview of the technology used in the query processing and 
optimization component of Oracle Rdb, a relational database management system 
originally developed by Digital Equipment Corporation and now under development by 
Oracle Corporation. Oracle Rdb is a production system that supports the most demanding 
database applications, runs on multiple platforms and in a variety of environments. 

Keywords: Dynamic optimization, Optimizer, Query transformation, Relational database, 
Sampling 



20 Multiway spatial joins 

Nikos Mamoulis, Dimitris Papadias 

December 2001 ACM Transactions on Database Systems (TODS), Volume 26 issue 4 
Publisher: ACM Press 

Full text available: *B pdf( 2.04 MB) Additional Information: full citation , abstract , references , citings , index 
l^. _„ terms , review 

Due to the evolution of Geographical Information Systems, large collections of spatial 
data having various thematic contents are currently available. As a result, the interest of 
users is not limited to simple spatial selections and joins, but complex query types that 
implicate numerous spatial inputs become more common. Although several algorithms 
have been proposed for computing the result of pairwise spatial joins, limited work exists 
on processing and optimization of multiway spatial join ... 

Keywords: Multiway joins, query processing, spatial joins 
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1 Materialized view selection and maintenance using multi-query optimization 
Hoshi Mistry, Prasan Roy, S. Sudarshan, Krithi Ramamritham 

May 2001 ACM SIGMOD Record , Proceedings of the 2001 ACM SIGMOD international 

conference on Management of data SIGMOD '01, volume 30 issue 2 
Publisher: ACM Press 



Full text available: 



[pdfd 99.46 KB) Additional Information: full citation , abstract , references , citings , index 
' ! terms 



Materialized views have been found to be very effective at speeding up queries, and are 
increasingly being supported by commercial databases and data warehouse systems. 
However, whereas the amount of data entering a warehouse and the number of 
materialized views are rapidly increasing, the time window available for maintaining 
materialized views is shrinking. These trends necessitate efficient techniques for the 
maintenance of materialized views. 

In this paper, we show how to find an ... 



Cost-based optimization of decision support queries using transient-views 
Subbu N. Subramanian, Shivakumar Venkataraman 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

conference on Management of data SIGMOD '98, Volume 27 issue 2 
Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings , index 
terms 



Full text available: 1 g pdfd. 58 MB) 



Next generation decision support applications, besides being capable of processing huge 
amounts of data, require the ability to integrate and reason over data from multiple, 
heterogeneous data sources. Often, these data sources differ in a variety of aspects such 
as their data models, the query languages they support, and their network protocols. 
Also, typically they are spread over a wide geographical area. The cost of processing 
decision support queries in such a setting is quite high. Ho ... 

Efficient and extensible algorithms for multi query optimization 
Prasan Roy, S. Seshadri, S. Sudarshan, Siddhesh Bhobe 

May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 

conference on Management of data SIGMOD 'OO, Volume 29 issue 2 
Publisher: ACM Press 



http://portal.acm.org/resultsxfm?coll=ACM&dl=ACM&CFID=3489922 10/31/2006 



Results (page 1): optimize execution plan query estimates overlap materialized view Page 2 of 7 
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terms 

Complex queries are becoming commonplace, with the growing use of decision support 
systems. These complex queries often have a lot of common sub-expressions, either 
within a single query, or across multiple such queries run as a batch. Multiquery 
optimization aims at exploiting common sub-expressions to reduce evaluation cost. Multi- 
query optimization has hither-to been viewed as impractical, since earlier algorithms were 
exhaustive, and explore a doubly exponential search space. 



Int... 

4 Query processing for rel ational data: Query evaluation using overlapping views: 
<g> completeness and efficiency 
w Gang Gou, Maxim Kormilitsin, Rada Chirkova 

June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 

Management of data SIGMOD '06 
Publisher: ACM Press 

Full text available: ^,pdf(383.41 KB} Additional Information: full citation , abstract , references , index terms 

We study the problem of finding efficient equivalent view-based rewritings of relational 
queries, focusing on query optimization using materialized views under the assumption 
that base relations cannot contain duplicate tuples. A lot of work in the literature 
addresses the problems of answering queries using views and query optimization. 
However, most of it proposes solutions for special cases, such as for conjunctive queries 
(CQs) or for aggregate queries only. In addition, most of it addresses ... 

Keywords: materialized views, query optimization, rewriting queries using views 
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Research sessions: query processi ng I: Exploiting statistics on query expressions for 
optimization 

Nicolas Bruno, Surajit Chaudhuri 

June 2002 Proceedings of the 2002 ACM SIGMOD international conference on 
Management of data SIGMOD '02 

Publisher: ACM Press 

Full text available: ■gpdfd.33 MB^ Additional Information: full citation, abstract , references , citings , index 

terms 

Statistics play an important role in influencing the plans produced by a query optimizer. 
Traditionally, optimizers use statistics built over base tables and assume independence ' 
between attributes while propagating statistical information through the query plan. This 
approach can introduce large estimation errors, which may result in the optimizer 
choosing inefficient execution plans. In this paper, we show how to extend a generic 
optimizer so that it also exploits statistics built on expression ... 

Query evaluation techniques for large databases 
Goetz Graefe 

June 1993 ACM Computing Surveys (CSUR), Volume 25 issue 2 
Publisher: ACM Press 

Full text available: pdf(9.37 MB^ Additional Information: full citation, abstract, references , citings, index 

terms , review 

Database management systems will continue to manage large data volumes. Thus, 
efficient algorithms for accessing and manipulating large sets and sequences will be 
required to provide acceptable performance. The advent of object-oriented and extensible 
database systems will not solve this problem. On the contrary, modern data models 
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exacerbate the problem: In order to manipulate large sets of complex objects as 
efficiently as today's database systems manipulate simple records, query-processi ... 

Keywords: complex query evaluation plans, dynamic query evaluation plans, extensible 
database systems, iterators, object-oriented database systems, operator model of 
parallelization, parallel algorithms, relational database systems, set-matching algorithms, 
sort-hash duality 



The state of the art in distributed query processing 
Donald Kossmann 

December 2000 ACM Computing Surveys (CSUR), Volume 32 Issue 4 
Publisher: ACM Press 

Full text available: f B pdf(455.39 KB) Additional Information: full citation , abstract, references, citings, index 

terms 

Distributed data processing is becoming a reality. Businesses want to do it for many 
reasons, and they often must do it in order to stay competitive. While much of the 
infrastructure for distributed data processing is already there (e.g., modern network 
technology), a number of issues make distributed data processing still a complex 
undertaking: (1) distributed systems can become very large, involving thousands of 
heterogeneous sites including PCs and mainframe server machines; (2) the stat ... 

Keywords: caching, client-server databases, database application systems, 
dissemination-based information systems, economic models for query processing, 
middleware, multitier architectures, query execution, query optimization, replication, 
wrappers 



Query optimization in distributed networks of autonomous database systems 
Fragkiskos Pentaris, Yannis Ioannidis 

June 2006 ACM Transactions on Database Systems (TODS), Volume 31 issue 2 
Publisher: ACM Press 

Full text available: || pdf(1.55 MB) Additional Information: full citation , abstract , references , index terms 

Large-scale distributed environments, where each node is completely autonomous and 
offers services to its peers through external. communication, pose significant challenges to 
query processing and optimization. Autonomy is the main source of the problem, as it 
results in lack of knowledge about any particular node with respect to the information it 
can produce and its characteristics, for example, cost of production or quality of produced 
results. In this article, inspired by e-commerce technolog ... 

Keywords: Query optimization 

9 Researchjaeers: O LAP: Efficient com putatio n o f multiple group by queries 
Zhimin Chen, Vivek Narasayya 

June 2005 Proceedings of the 2005 ACM SIGMOD international conference on 
Management of data 

Publisher: ACM Press 

Full text available: , ff| pdf(371.92 KB) Additional Information: full citation , abstract , references 

Data analysts need to understand the quality of data in the warehouse. This is often done 
by issuing many Group By queries on the sets of columns of interest. Since the volume of 
data in these warehouses can be large, and tables in a data warehouse often contain 
many columns, this analysis typically requires executing a large number of Group By 
queries, which can be expensive. We show that the performance of today's database 
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systems for such data analysis is inadequate. We also show that the pro ... 

10 Query processing: Estimating compilation time of a query optimizer 
Ihab F. Ilyas, Jun Rao, Guy Lohman, Dengfeng Gao, Eileen Lin 
June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: jjl pdf(292.76 KB) Additional Information: full citation, abstract, references , citings, index 
• IM ^ — - j terms 

A query optimizer compares alternative plans in its search space to find the best plan for 
a given query. Depending on the search space and the enumeration algorithm, optimizers 
vary in their compilation time and the quality of the execution plan they can generate. 
This paper describes a compilation time estimator that provides a quantified estimate of 
the optimizer compilation time for a given query. Such an estimator is useful for 
automatically choosing the right level of optimization in comme ... 

11 Answering queries using views: A survey 
Alon Y. Halevy 

December 2001 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 10 Issue 4 
Publisher: Springer-Verlag New York, Inc. 

Full text available: pdf(308.74 KB) Additional Information: full citation , abstract , citings , index terms 

The problem of answering queries using views is to find efficient methods of answering a 
query using a set of previously defined materialized views over the database, rather than 
accessing the database relations. The problem has recently received significant attention 
because of its relevance to a wide variety of data management problems. In query 
optimization, finding a rewriting of a query using a set of materialized views can yield a 
more efficient query execution plan. To support the separat ... 

Keywords: Data integration, Date warehousing, Materialized views, Query optimization, 
Survey, Web-site management 



12 A case for dynamic view management 
Yannis Kotidis, Nick Roussopoulos 

December 2001 ACM Transactions on Database Systems (TODS), Volume 26 issue 4 
Publisher: ACM Press 

Full text available* pdf(892 57 KB) Ac,d ' t ' onal Information: full citation , abstract , references , citings , index 
l^a-^-- 1 : terms , review 

Materialized aggregate views represent a set of redundant entities in a data warehouse 
that are frequently used to accelerate On-Line Analytical Processing (OLAP). Due to the 
complex structure of the data warehouse and the different profiles of the users who 
submit queries, there is need for tools that will automate and ease the view selection and 
management processes. In this article we present DynaMat, a system that manages 
dynamic collections of materialized aggregate views in a data warehous ... 

Keywords: Data cube, OLAP, data warehousing, materialized views 



13 Opti mization of query stream s usi ng semantic prefetching 
jgfcv Ivan T. Bowman, Kenneth Salem 

December 2005 ACM Transactions on Database Systems (TODS), volume 30 issue 4 

Publisher: ACM Press 



http://portal.acm.org/resu^ 10/31/2006 



Results (page 1): optimize execution plan query estimates overlap materialized view Page 5 of 7 



Full text available: g|pdf(1.10 MB) Additional Information: full citation , abstract , references . Index term s 

Streams of relational queries submitted by client applications to database servers contain 
patterns that can be used to predict future requests. We present the Scalpel system, 
which detects these patterns and optimizes request streams using context-based 
predictions of future requests. Scalpel uses its predictions to provide a form of semantic 
prefetching, which involves combining a predicted series of requests into a single request 
that can be issued immediately. Scalpel's semantic prefetching ... 

Keywords: Prefetching, query streams 

14 Cache investment: integrating query optimization and distributed data placement 
^ Donald Kossmann, Michael J. Franklin, Gerhard Drasch, Wig Ag 

▼ December 2000 ACM Transactions on Database Systems (TODS), Volume 25 issue 4 
Publisher: ACM Press 

Full text available: fg| Pdfl210.67 KB1 AdditionaI Information: full citation, abstract , references , citings , index 

terms 

Emerging distributed query-processing systems support flexible execution strategies in 
which each query can be run using a combination of data shipping and query shipping. As 
in any distributed environment, these systems can obtain tremendous performance and 
availability benefits by employing dynamic data caching. When flexible execution and 
dynamic caching are combined, however, a circular dependency arises: Caching occurs as 
a by-product of query operator placement, but query operator pi ... 

Keywords: cache investment, caching, client-server database systems, data shipping, 
dynamic data placement, query optimization, query shipping 

15 Query processing for relational data: Supporting ad-hoc ranking aggregates 
^ Chengkai Li, Kevin Chen-Chuan Chang, Ihab F. Ilyas 

June 2006 Proceedings of the 2006 ACM SIGMOD international conference on 

Management of data SIGMOD '06 
Publisher: ACM Press 

Full text available: l g,pdf(344 > 23 KB) Additional Information: full citation , abstract , references , index terms 

This paper presents a principled framework for efficient processing of ad-hoc top-k 
(ranking) aggregate queries, which provide the k groups with the highest aggregates as 
results. Essential support of such queries is.lacking in current systems, which process the 
queries in a naive materialize-group-sort scheme that can be prohibitively inefficient. Our 
framework is based on three fundamental principles. The Upper-Bound Principle dictates 
the requirements of early pruning, and ... . 

Keywords: OLAP, aggregate query, decision support, ranking, top-k query processing 

16 Cost-based query scrambling for initial delays j 
Tolga Urhan, Michael J. Franklin, Laurent Amsaleg 

June 1998 ACM SIGMOD Record , Proceedings of the 1998 ACM SIGMOD international 

conference on Management of data SIGMOD '98, volume 27 issue 2 
Publisher: ACM Press 

Full text available: , S) pdf(1.81 MB) Addition a' Information: full citation, abstract , references , citings , index 

terms 

Remote data access from disparate sources across a wide-area network such as the 
Internet is problematic due to the unpredictable nature of the communications medium 
and the lack of knowledge about the load and potential delays at remote sites. Traditional, 
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static, query processing approaches break down in this environment because they are 
unable to adapt in response to unexpected delays. Query scrambling has been proposed 
to address this problem. Scrambling modifies query execution plans o ... 

17 Research sessions: data integration: Adapting to source properties in processing 
data integration queries 
Zachary G. Ives, Alon Y. Halevy, Daniel S. Weld 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: IS ) pdfd 97.27 KB) Additional Information: full citation , abstract , references 

An effective query optimizer finds a query plan that exploits the characteristics of the 
source data. In data integration, little is known in advance about sources' properties, 
which necessitates the use of adaptive query processing techniques to adjust query 
processing on-the-fly. Prior work in adaptive query processing has focused on 
compensating for delays and adjusting for mis-estimated cardinality or selectivity values. 
In this paper, we present a generalized architecture for adaptiv ... 

18 Query execution techniques for caching expensive methods 
Joseph M. Hellerstein, Jeffrey F. Naughton 

June 1996 ACM SIGMOD Record , Proceedings of the 1996 ACM SIGMOD international 

conference on Management of data SIGMOD '96, Volume 25 issue 2 
Publisher: ACM Press 

Full text available: « pdfft. 53 MB) Additional Information: full citation , abstract, references , citings, index 
^ terms 

Object-Relational and Object-Oriented DBMSs allow users to invoke time-consuming 
("expensive") methods in their queries. When queries containing these expensive 
methods are run on data with duplicate values, time is wasted redundantly computing 
methods on the same value. This problem has been studied in the context of 
programming languages, where "memoization" is the standard solution. In the database 
literature, sorting has been proposed to deal with this problem. We compare these 
approaches al ... 

19 Research sessions: non-standard query processing: Optimization of query streams 
using semantic prefetching 
Ivan T. Bowman, Kenneth Salem 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 

Management of data 
Publisher: ACM Press 

Full text available: ||| pdf(224.40 KB) Additional Information: full citation , abstract , references 

Streams of relational queries submitted by client applications to database servers contain 
patterns that can be used to predict future requests. We present the Scalpel system, 
which detects these patterns and optimizes request streams using context-based 
predictions of future requests. Scalpel uses its predictions to provide a form of semantic 
prefetching, which involves combining a predicted series of requests into a single request 
that can be issued immediately. Scalpel's semantic prefetching ... 

20 TinyDB: an acquisitional query processing system for sensor networks 
^ Samuel R. Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong 

>^ March 2005 ACM Transactions on Database Systems (TODS), volume 30 issue l 
Publisher: ACM Press 

Full text available: l f|j) pdf(1.67 MB) Additional Information: full citation , abstract , references , index terms 
We discuss the design of an acquisitional query processor for data collection in sensor 
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networks. Acquisitional issues are those that pertain to where, when, and how often data 
is physically acquired {sampled) and delivered to query processing operators. By focusing 
on the locations and costs of acquiring data, we are able to significantly reduce power 
consumption over traditional passive systems that assume the a priori existence of data 
We discuss simple extensions to SQL for controlli ... 

Keywords: Query processing, data acquisition, sensor networks 
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